Picture for Hung-yi Lee

Hung-yi Lee

AQAScore: Evaluating Semantic Alignment in Text-to-Audio Generation via Audio Question Answering

Add code
Jan 21, 2026
Viaarxiv icon

AQUA-Bench: Beyond Finding Answers to Knowing When There Are None in Audio Question Answering

Add code
Jan 18, 2026
Viaarxiv icon

On the Fallacy of Global Token Perplexity in Spoken Language Model Evaluation

Add code
Jan 09, 2026
Viaarxiv icon

Style Amnesia: Investigating Speaking Style Degradation and Mitigation in Multi-Turn Spoken Language Models

Add code
Dec 29, 2025
Viaarxiv icon

AdaSearch: Balancing Parametric Knowledge and Search in Large Language Models via Reinforcement Learning

Add code
Dec 18, 2025
Viaarxiv icon

ParaS2S: Benchmarking and Aligning Spoken Language Models for Paralinguistic-aware Speech-to-Speech Interaction

Add code
Nov 11, 2025
Viaarxiv icon

Pseudo2Real: Task Arithmetic for Pseudo-Label Correction in Automatic Speech Recognition

Add code
Oct 09, 2025
Viaarxiv icon

Full-Duplex-Bench-v2: A Multi-Turn Evaluation Framework for Duplex Dialogue Systems with an Automated Examiner

Add code
Oct 09, 2025
Viaarxiv icon

SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models

Add code
Oct 08, 2025
Figure 1 for SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Figure 2 for SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Figure 3 for SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Figure 4 for SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Viaarxiv icon

Hearing the Order: Investigating Selection Bias in Large Audio-Language Models

Add code
Oct 01, 2025
Viaarxiv icon